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[Title of the Invention] NON-NATURALLY-OCCURRING AMINO 
ACID-CONTAINING PROTEIN EXPRESSION METHOD 
[Claims] 

[Claim 1] An expression method for 

non-naturally-occurring amino acid-containing protein 
comprising: expressing in animal cells: 

(A) a mutant tyrosyl-tRNA synthetase that is a derivative of 
tyrosyl-tRNA synthetase originating from E. coli with an enhanced 
specificity for a non-naturally- occurring tyrosine derivative 
as compared with the specificity for tyrosine; 

(B) suppressor tRNA originating in Bacillus species, Mycoplasma 
species or Staphylococcus species of eubacteria and capable of 
binding with the tyrosine derivative in the presence of the mutant 
tyrosyl tRNA synthetase; and, 

(C) a desired protein gene that has undergone a nonsense mutation 
at a desired site; 

wherein, the tyrosine derivative is incorporated in response 
to the created nonsense codon. 

[Claim 2] The expression method according to claim 1 

wherein the tyrosine derivative is a position 3-substituted 
tyrosine . 

[Claim 3] The expression method according to claim 1 or 

2 wherein the suppressor tRNA of (B) is suppressor tyrosine tRNA 
originating in Bacillus stearothermophilus . 



[Claim 4] The expression method according to any of 

claims 1 to 3 wherein the mutant tyrosyl tRNA synthetase of (A) 
is a mutant TyrRS that has undergone an alteration at the location 
corresponding to tyrosine at position 37 and glutamine at 
position 195 of tyrosyl tRNA synthetase. 

[Claim 5] The expression method according to claim 4 

wherein the mutant tyrosyl tRNA synthetase of (A) is a mutant 
TyrRS in which tyrosine (Y) at position 37 is substituted with 
valine (V), leucine (L) , isoleucine (I) or alanine (A), and 
glutamine (Q) at position 195 is substituted with alanine (A) , 
cysteine (C) , serine (S) or asparagine (N) . 

[Claim 6] The expression method according to any of 

claims 1 to 5 wherein the animal cells are mammalian cells. 

[Claim 7] A non-naturally-occurring amino 

acid-containing protein production method comprising: 
recovering and purifying a protein expressed according to any 
of the methods according to claims 1 to 6. 

[Claim 8] Animal cells containing: 

(A) an expression vector that expresses in animal cells a mutant 
tyrosyl-tRNA synthetase that is a derivative of tyrosyl tRNA 
synthetase from E. coli with an enhanced specificity for a 
non-naturally-occurring tyrosine derivative as compared with the 
specificity for tyrosine; 

(B) an expression vector that expresses in the animal cells a 
suppressor tRNA originating in a Bacillus species, Mycoplasma 



species or Staphylococcus species of eubacteria capable of 
binding with the tyrosine derivative in the presence of the mutant 
tyrosyl tRNA synthetase; and, 

(C) an expression vector that expresses in the animal cells a 
desired protein gene that has undergone a nonsense mutation at 
a desired site; wherein, 

the tyrosine derivative is incorporated at the site of the 
nonsense mutation of the protein. 

[Claim 9] The animal cells according to claim 8 wherein 

the tyrosine derivative is a position 3-substituted tyrosine. 

[Claim 10] The animal cells according to claim 8 or 9 
wherein the suppressor tRNA of (B) is suppressor tyrosine tRNA 
originating in Bacillus stearothermophilus . 

[Claim 11] The animal cells according to any of claims 8 
to 10 wherein the mutant tyrosyl tRNA synthetase of (A) is a mutant 
tyrosyl tRNA synthetase that has undergone an alteration at the 
location corresponding to tyrosine at position 37 and glutamine 
at position 195 of tyrosyl tRNA synthetase. 

[Claim 12] The animal cells according to claim 11 wherein 
the mutant tyrosyl tRNA synthetase of (A) is a mutant TyrRS in 
which tyrosine (Y) at position 37 is substituted with valine (V) , 
leucine (L) , isoleucine (I) or alanine (A), and glutamine (Q) 
at position 195 is substituted with alanine (A) , cysteine (C) , 
serine (S) or asparagine (N) . 

[Claim 13] The animal cells according to any of claims 8 



to 12 that are mammalian cells. 

[Claim 14] DNA having a sequence of SEQ. ID NO. 1. 

[Claim 15] An expression vector capable of being 
expressed from a control sequence recognized in animal cells 
comprising a sequence of SEQ. ID NO. 1. 

[Claim 16] The expression vector according to claim 15 
that carries nine copies of DNAs having the sequence of SEQ. ID 
NO. 1 in the same direction. 
[Detailed Description of the Invention] 
[0001] 

[Technical Field of the Invention] 

The present invention relates to a non-naturally-occurring 
amino acid-incorporated protein expression method by which a 
non-naturally occurring amino acid is incorporated at a desired 
site in a protein, as well as DNA, an expression vector and animal 
cells used for expressing the aforementioned protein. 
[0002] 

[Description of the Related Art] 

Non-naturally-occurring amino acid-containing proteins (to 
also be referred to as alloproteins ) , in which an amino acid 
residue at a desired location in a protein is substituted with 
an amino acid other than the 20 types of amino acids normally 
involved in protein synthesis (to be referred to as 
non-naturally-occurring amino acids) , offer an effective means 
of analyzing the function and structure of proteins. 



[0003] 

Aminoacyl t-RNA synthetases (to be referred to as aaRS) are 
enzymes that specifically bind amino acids and tRNA, and 
excluding a few exceptions, each biological species has 20 types 
of such enzymes corresponding to the 20 types of amino acids, 
respectively, that exist in nature. These aaRS, basically 
existing for each amino acid within cells, determine the types 
of amino acid included in the genetic code. For example, TyrRS, 
which is a kind of aaRS (to simply be referred to as TyrRS) , 
distinguishes tyrosine tRNA (to be referred to as tRNA Tyr ) from 
the tRNA of other amino acids, and binds only tyrosine to tRNA Tyr 
without binding other amino acids. 
[0004] 

Known methods for producing alloproteins in the prior art 
consist of a method that produces alloproteins in Escherichia 
coli (Non-patent Document 1) , and a method that produces 
alloproteins in a cell-free translation system (Non-patent 
Document 2) . 
[0005] 

In order to prepare proteins containing 21 types of amino 
acids, including non-naturally-occurring amino acids, in large 
yield, it is necessary to construct an artificial genetic code 
system by which tRNAs are aminoacylated with non-naturally- 
occurring amino acids by specific aaRSs in a system in which a 
translation reaction takes place. 



It is necessary to find aaRS-tRNA pairs that satisfy the 
following conditions in order to construct such an artificial 
genetic code system: 

(1) the aaRS must be an aaRS mutant that reacts specifically with 
a desired non-naturally occurring amino acid and not with any 
of the ordinary 20 types of amino acids; and, 

(2) the tRNA must be assigned to a codon to which none of the 
ordinary 20 types of amino acids are assigned (such as a nonsense 
codon or 4-base codon) , must be recognized only by the 
aforementioned aaRS mutant specific for a non-naturally- 
occurring amino acid, and must not be recognized by any aaRS of 
the host (orthogonal tRNA) . 

[0006] 

A tRNA molecule that binds a non-naturally-occurring amino 
acid and transports it to a nonsense codon on messenger RNA 
(suppressor tRNA molecule) , and an enzyme that binds a 
non-naturally-occurring amino acid to this suppressor tRNA 
molecule (aaRs ) , can be used to create a genetic code system that 
satisfies these conditions. This mechanism is described in 
Non-patent Document 3 and Non-patent Document 4 . 

Specific artificial genetic code systems based on this 
mechanism have been established in E. coli as well as cell-free 
protein synthesis systems using wheat germ extract. 

Namely, a system for producing protein containing a 
non-naturally-occurring amino acid at an arbitrary specified 



site in E. coli has been reported that consists of 
O-methyltyrosine being specifically inserted corresponding to 
an amber codon by expressing a TyrRS mutant originating in 
Methanococcus jannaschii that has been altered so as to 
specifically aminoacylate O-methyltyrosine, and an amber 
suppressor tRNA in which tyrosine tRNA from the same 
microorganism has been altered (Non-patent Document 3) . 
[0007] 

In addition, aaRS has been developed for producing protein 
containing a non-naturally-occurring amino acid at an arbitrary 
site in wheat germ extract (Non-patent Document 5) . 
[0008] 

In this manner, although alloprotein production systems have 
been developed in E. coli and cell-free protein synthesis systems, 
these cannot be used directly in animal cells. Namely, these 
molecules developed for use in E . coli cannot be used in animal 
cells due to the properties of the molecules themselves . In order 
to synthesize protein containing a non-naturally-occurring amino 
acid at an arbitrary specified site in animal cells, suitable 
suppressor tRNA and aaRS specific for both this suppressor tRNA 
and the non-naturally-occurring amino acid must be developed, 
and the expression of those molecules must be realized in animal 
cells . 
[0009] 

In addition, the aforementioned aaRS used in a cell-free 
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protein synthesis system in wheat germ extract is predicted to 
be able to be used in animal cells in consideration of its 
substrate specificity. However, in order to use this aaRS, it 
is necessary to develop suppressor tRNA to be combined therewith 
along with the expression system for this tRNA. 
[0010] 

[Non-patent Document 1] Koide, et al., Proceedings of the 
National Academy of Science USA, Vol. 85, 1988, pp. 6237-6241 

[Non-patent Document 2] Noren et al., Science Vol. 244, 1989, 
p. 182-8 

[Non-patent Document 3] Wang, et al., Science, Vol. 292, 2001, 
pp. 498-500 

[Non-patent Document 4] Journal of the American Chemical 
Society, Vol. 124, 2002, pp. 1836-1937 

[Non-patent Document 5] Kiga, et al., Proceedings of the 
National Academy of Science USA, Vol. 99, July 23, 2002, pp. 
9715-9723 
[0011] 

[Problems to be Solved by the Invention] 

The object of the present invention is to provide a method 
for expressing non-naturally-occurring amino acid-containing 
protein in animal cells, and to provide DNA, an expression vector 
and animal cells for expressing that protein. 
[0012] 

[Means for Solving the Problem] 



The present invention provides the expression methods 
described below. 

(1) An expression method for non-naturally-occurring amino 
acid-containing protein comprising: expressing in animal cells: 

(A) a mutant TyrRS that is a derivative of TyrRS originating from 
E. coli with an enhanced specificity for a non-naturally- 
occurring tyrosine derivative as compared with the specificity 
for tyrosine (referred to as mutant TyrRS) ; 

(B) suppressor tRNA originating in Bacillus species, Mycoplasma 
species or Staphylococcus species of eubacteria and capable of 
binding with the tyrosine derivative in the presence of the mutant 
TyrRS; and, 

(C) a desired protein gene that has undergone a nonsense mutation 
at a desired site; 

wherein, the tyrosine derivative is incorporated in response 
to the created nonsense codon. 

(2) The expression method according to (1) wherein the tyrosine 
derivative is a position 3-substituted tyrosine. 

(3) The expression method according to (1) or (2) wherein the 
suppressor tRNA of (B) is suppressor tRNA originating in Bacillus 
stearothermophilus . 

(4) The expression method according to any of (1) to (3) wherein 
the mutant TyrRS of (A) is a mutant TyrRS that has undergone an 
alteration at the location corresponding to tyrosine at position 
37 and glutamine at position 195 of TyrRS. 
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(5) The expression method according to (4) wherein the mutant 
TyrRS of (A) is a mutant TyrRS in which tyrosine (Y) at position 
37 is substituted with valine (V), leucine (L) , isoleucine (I) 
or alanine (A) , and glutamine (Q) at position 195 is substituted 
with alanine (A) , cysteine (C) , serine (S) or asparagine (N) . 

(6) The expression method according to any of (1) to (5) wherein 
the animal cells are mammalian cells. 

In addition, the present invention provides the protein 
production method described below. 

(7) A non-naturally-occurring amino acid-containing protein 
production method comprising: recovering and purifying a protein 
expressed according to any of the methods described in (1) to 

(6) . 

In addition, the present invention also provides the animal 
cells described below. 

(8) Animal cells containing: 

(A) an expression vector that expresses in animal cells a mutant 
of TyrRS from E. coli with an enhanced specificity for a 
non-naturally-occurring tyrosine derivative as compared with the 
specificity for tyrosine; 

(B) an expression vector that expresses in the animal cells a 
suppressor tRNA originating in a Bacillus species, Mycoplasma 
species or Staphylococcus species of eubacteria capable of 
binding with the tyrosine derivative in the presence of the mutant 
TyrRS; and, 
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(C) an expression vector that expresses in the animal cells a 
desired protein gene that has undergone a nonsense mutation at 
a desired site; wherein, 

the tyrosine derivative is incorporated at the site of the 
nonsense mutation of the protein. 

(9) The animal cells according to (8) wherein the tyrosine 
derivative is a position 3-substituted tyrosine. 

(10) The animal cells according to (8) or (9) wherein the 
suppressor tRNA of (B) is suppressor tRNA originating in Bacillus 
stearothermophilus . 

(11) The animal cells according to any of (8) to (10) wherein the 
mutant TyrRS of (A) is a mutant TyrRS that has undergone an 
alteration at the location corresponding to tyrosine at position 
37 and glutamine at position 195 of TyrRS. 

(12) The animal cells according to (11) wherein the mutant TyrRS 
of (A) is a mutant TyrRS in which tyrosine (Y) at position 37 
is substituted with valine (V) , leucine (L) , isoleucine (I) or 
alanine (A) , and glutamine (Q) at position 195 is substituted 
with alanine (A) , cysteine (C) , serine (S) or asparagine (N) . 

(13) The animal cells according to any of (8) through (12) that 
are mammalian cells. 

In addition, the present invention provides the DNA and 
expression vectors described below. 

(14) DNA having a sequence of SEQ. ID NO. 1. 

(15) An expression vector capable of being expressed from a 
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control sequence recognized in animal cells comprising a sequence 
of SEQ. ID NO. 1. 

(16) The expression vector according to (15) that carries nine 
copies DNA having the sequence of SEQ. ID NO. 1 in the same 
direction. 
[0013] 

[Embodiments of the Invention] 

The expression method of the present invention is: 

an expression method for non-naturally-occurring amino 

acid-containing protein comprising: expressing in animal cells: 

(A) a mutant TyrRS that is a derivative of TyrRS originating in 
prokaryote with an enhanced specificity for a non-naturally- 
occurring tyrosine derivative as compared with specificity for 
tyrosine; 

(B) suppressor tRNA originating in Bacillus species, Mycoplasma 
species or Staphylococcus species of eubacteria and capable of 
binding with the tyrosine derivative in the presence of the mutant 
TyrRS; and, 

(C) a desired protein gene that has undergone a nonsense mutation 
at a desired site; 

wherein, the tyrosine derivative is incorporated at the 
location of the protein nonsense mutation. 
[0014] 

Next, a detailed description is provided of the mutant TyrRS 
and suppressor tRNA used in the expression method of the present 
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invention . 
[0015] 

(1) Mutant TyrRS 

The mutant TyrRS used in the present invention is a derivative 
of TyrRS form E. coli that specifically recognizes a tyrosine 
derivative, specifically recognizes suppressor tRNA used in 
combination therewith and forms the suppressor tRNA 
aminoacylated with the tyrosine derivative. Examples of E. coli 
include strain K12 and strain B. 

The TyrRS (wild type) originating in E. coli does not react 
with tRNA of eukaryotes, while tRNA originating in E. coli does 
not react with TyrRS of eukaryotes. 

[0016] 

Examples of the aforementioned tyrosine derivative include 
position 3-substituted tyrosine having a substituent at position 
3 of the phenyl group of tyrosine. Examples of position 
3-substituted tyrosine include 3-halogenated tyrosines such as 
3-iodotyrosine and 3-bromotyrosine . These amino acids 
themselves are non-naturally-occurring amino acids having 
physiological activity, or to be used as targets of site-specific 
labels of proteins. Therefore, a protein containing a 
3-halogenated tyrosine, for example, is useful as a research 
material for analyzing the structure and function of this protein, 
while proteins containing other amino acids may have the 
potential for being a target of drug development. 
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[0017] 

The amino acid sequence (SEQ. ID NO. 29) of TyrRS (wild type) 
of E . coli is shown in Fig. 8 using a one-letter symbols for amino 
acids . 

The gene coding for the mutant TyrRS used in the present 
invention can be prepared by substituting the amino acid residues 
probably involved in the recognition of tyrosyl-AMP by the enzyme, 
by a known method of site-directed mutagenesis (described later ) . 
These amino acids are identified in the sequence shown in Fig. 
8 by referring to the amino acids involved in the tyrosyl-AMP 
recognition in a known tertiary structure of the complex between 
TyrRS and tyrosyl-AMP (for example, Brick et al., J. Mol . Biol. 
Vol. 208, p. 83, 1988) . 

A preferable example of a mutant TyrRS used in the present 
invention is a mutant on which specificity for 3-halogenated 
tyrosine is conferred by site-specifically substituting at least 
tyrosine (Y) at position 37 and glutamine (Q) at position 195 
with other amino acids in this sequence (see Non-patent Document 
5) . 

[0018] 

More preferably, a mutant TyrRS can be used in which tyrosine 
(Y) at position 37 is substituted with valine (V) , leucine (L) , 
isoleucine (I) or alanine (A), and glutamine (Q) at position 195 
is substituted with alanine (A) , cysteine (C) , serine (S) or 
asparagine (N) . Results demonstrating that the specificity for 
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3-halogenated tyrosines is enhanced by these mutations are shown 
in Table 1. Table 1 shows the measurement of an aminoacylation 
reaction by detecting inorganic phosphorous using Biomol Green 
(Funakoshi) based on a simplified version of the method of Lloyd, 
et al. that quantifies the amount of inorganic phosphoric acid 
produced by degrading pyrophosphoric acid, which is one of the 
reaction products of the aminoacylation reaction, with 
pyrophosphatase (Lloyd, et al,, Nucleic Acids Research, Vol. 23 
(1995) pp. 2286-2892) . 
[0019] 

[Table 1] 



Activity of TyrRS variants with tyrosine, 3-iodo-L-tyrosine 



Enzyme 


Amino acid 


Enzyme 


Amino acid 


Tyrosine 


3-iodo- 
L-tyrosine 


Tyrosine 


3-iodo- 
L-tyrosine 


Wild Type 
(Y37, 
Q195) 


+++ 




V37 


+++ 


+++ 


A37A195 






I37A195 


+ 


+ 


A37C195 


+ 


++ 


I37C195 






A37N195 






I37N195 






A37S195 


+ 


+ 


I37S195 






V37A195 


+ 


++ 


L37A195 


+ 




V37C195 


+ 


++ 


L37C195 


+ 


+ 


V37N195 


+ 


++ 


L37N195 


+ 




V37S195 


++ 


++ 


L37S195 







+++ : the activity detected with an enzyme concentration of 0 . 05|aM 
++: the activity detected with an enzyme concentration of 0.25jxM 
+ : the activity detected with an enzyme concentration of 0.25|iM, 
although the activity is lower than that represented by "++" 
-: no activity detected with an enzyme concentration of 0.25|aM 

[0020] 

In particular, based on the results of Table 1, the mutant 
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in which position 37 is valine and position 195 is cysteine (to 
be referred to as V37C195), the mutant in which position 37 is 
valine and position 195 is asparagine (referred to as V37N195) , 
the mutant in which position 37 is valine and position 195 is 
alanine (referred to as V37A195) and the mutant in which position 
37 is alanine and position 195 is cysteine (referred to as A37C195) 
are particularly preferable. 
[0021] 

Next, a method using a known genetic manipulation technique 
is preferable for the method used to produce these mutants. For 
example, DNA sequences with desired amino acid replacements are 
amplified with the primers for achieving these replacements, and 
are then concatenated with each other into the full-length gene 
for a desired mutant aaRS . This aaRS may easily be prepared by 
expressing it in host cells, such as E. coli. The primer used 
in this method has 20 to 70 bases and preferably about 20 to 50 
bases. Since this primer is mismatched by 1 to 3 bases with the 
original base sequence prior to alteration, the use of a 
comparatively long primer of, for example, 20 bases or more is 
preferable . 
[0022] 

More specifically, the fragment amplified by PCR according 
to the method described in Document 5 using the following primers 
(1) and (2) , together with E. coli genomic DNA for the PCR template, 
are treated with Ndel and Hindlll, followed by the incorporation 
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of this fragment at the Ndel-Hindlll site of pET26b to produce 
TyrRS expression vector pRT-YRS. The use of E. coli genomic DNA 
for the template means that E . coli cells collected from about 
0.1 mL of £. coli culture are heated for 10 minutes at 95°C before 
being added to the PCR reaction mixture. 
[0023] 

Primer (1) : GGAATTCCATATGGCAAGCAGTAACTTGATTAAACAATTGCAAG (SEQ. 

ID NO. 2) 

[0024] 

Primer (2) : GCCGAAGCTTGTCGACTTTCCAGCAAATCAGACAGTAATTCTTTTTACCG 

(SEQ. ID NO. 3) 

[0025] 

Next, an explanation is provided of an example of a method 
for site-specifically altering amino acids at positions 37 and 
195. 

First, mutants are produced in which only one amino acid is 
substituted at position 37 or position 195. The primers (3) to 
(8) used to produce DNA sequences that encode mutants in which 
one amino acid each is substituted at position 37 and position 
195 are as shown below. 
[0026] 

Primer (3) : AGGATCGAAGCCGCAAGCGAGCGCGATCGGGCCTTGCGCC 

(SEQ. ID NO. 4) 

[0027] 

Primer (4) : AGGATCGAAGCCGCAMNNGAGCGCGATCGGGCCTTGCGCC 
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(SEQ. ID NO. 5) 
[0028] 

M represents C or A, and N represents A, C, G or T. 
[0029] 

Primer (5) : ACGGTGTGGTGCTGTCTATTGGTGGTTCTGACC 

(SEQ. ID NO. 6) 

[0030] 

Primer (6): ACGGTGTGGTGCTGGCAATTGGTGGTTCTGACC 

(SEQ. ID NO. 7) 

[0031] 

Primer (7) : ACGGTGTGGTGCTGAACATTGGTGGTTCTGACC 

(SEQ. ID NO. 8) 

[0032] 

Primer (8) : ACGGTGTGGTGCTGTGCATTGGTGGTTCTGACC 

(SEQ. ID NO. 9) 

[0033] 

Next, mutants are created in which the amino acids at 
positions 37 and 195 are both substituted. 

The DNA sequence coding for the mutant with both 
substitutions is created by an overlap extension using the 
plasmids carrying the above mutant genes each with a single 
substitution at position 37 or 195, and is then inserted between 
the Ndel and BamRI sites of pET-YRS. This overlap extension can 
be performed by PCR with the primers (1) and (9) for amplifying 
two DNA fragments partially overlapped with each other. These 
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fragments are obtained by PCR with primer pairs (1) and (10), 
and (9) and (11) and purified for the use in the overlap extension . 
[0034] 

Primer (1) : GGAATTCCATATGGCAAGCAGTAACTTGATTAAACAATTGCAAG 

(SEQ. ID NO. 2) 

[0035] 

Primer (10) : GATCATCTGGTTAACGGAGAAGTGTTTGCC 

(SEQ. ID NO. 11) 

[0036] 

Primer (9) : TTCTTCGGATCCAACCAGACTGCGCCGCCTTC 

(SEQ. ID NO. 10) 

[0037] 

Primer (11) : GACCTTCCTGTGCGATATTGGCAAAC 

(SEQ. ID NO. 12) 

[0038] 

Each of the full-length mutant DNA fragments obtained in the 
aforementioned process is inserted into the locations of the 
template fragments inside plasmid pET-YRS, and inserted into each 
E. coli BLR (DE3) according to a transformation method complying 
with the method of Hanahan (Hanahan, D. , J. Mol . Bio., 166, 
557-580) with plasmids containing mutant TyrRS gene. Mutant 
TyrRS can then be expressed in E. coli by isolating and culturing 
transf ormants having each plasmid. 
[0039] 

Moreover, the method used to create mutant TyrRS is not 
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limited to the aforementioned method, but rather various genetic 
manipulation technologies can be used, including known point 
mutation technologies and methods for inserting altered 
fragments using restriction endonuclease. 
[0040] 

(2) Suppressor tRNA 

Suppressor tRNA used in combination with the aforementioned 
mutant TyrRS satisfies the requirements of being assigned to a 
nonsense codon, which is no codon assigned to the normal 20 types 
of amino acids, being recognized only by the aforementioned 
non-naturally-occurring amino acid-specific TyrRS mutant, not 
being recognized by any aaRS of the host, and being able to 
expressed in eukaryotic cells. 

Although all of three nonsense codons UAG (amber) , UAA 
(ochre) and UGA (opal) are usable, UAG (amber) is used preferably . 
[0041] 

The inventors of the present invention checked whether 
suppressor tRNA originating in E. coli would be suitable for being 
combined with mutant TyrRS originating in £. coli, and tried to 
express this E. coli suppressor tRNA Tyr in eukaryotic cells. The 
sequence and structure of suppressor tRNA originating in E. coli 
tRNA are already known (M. Sprinzl, etal., Nucleic Acids Research, 
17, 1-172 (1989)). This is shown in Fig. 2. 
[0042] 

In general, the expression of tRNA in eukaryotic cells 
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requires two internal promoters within the tRNA coding sequence, 
and their consensus sequences are known as box A and box B. The 
consensus sequence of box A (TRGCNNAGYNGG; SEQ. ID NO. 13) 
corresponds to TRGCNNAGY at positions 8 to 16 and GG at positions 
18 and 19 of tRNA, while the consensus sequence of box B 
corresponds to GGTTCGANTCC at positions 52 to 62 (SEQ. ID NO. 
14). These positions are encircled in Fig. 2. The consensus 
sequence of box B may also be AGTTCGANTCT (SEQ. ID NO. 20) . 

As shown in Fig. 2, although E. coli suppressor tRNA has a 
box B consensus sequence in its sequence, it does not contain 
a box A consensus sequence. Accordingly, in order to express 
this E. coli suppressor tRNA in eukaryotic cells, U9 and CIO were 
substituted with A and G, respectively, to create a consensus 
sequence of box A (Fig. 2) , and the resulting mismatch base pairs, 
G10 and G25, were corrected by G25C substitution. Whereupon, 
as shown in the comparative examples to be described later, 
suppressor activity failed to be detected even when combined with 
the aforementioned mutant E. coli TyrRS. 
[0043] 

On the other hand, since the suppressor tRNA from Bacillus 
stearothermophilus , another prokaryote, originally has box B and 
box A within its sequence (M. Sprinzl, et al., Nucleic Acids 
Research, 17, 1-172 (1989)) (see Fig. 2), it was thought that 
it could be expressed in eukaryotes without any alterations. 
[0044] 
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Then, B . stearothermophilus suppressor tRNA was cloned to 
a vector for its introduction into animal cells without any 
alterations. It was, in fact, expressed in animal cells (refer 
to the examples described later) , and exhibited suppression 
activity when combined with the aforementioned E . coli TyrRS 

(refer to the examples to be described later) . 

[0045] 

Namely, prokaryotic suppressor tRNA, which is able to retain 
suppressor activity in its form that has a box A sequence and 
box B sequence, can exhibit suppressor activity in eukaryotes 
when combined with the aforementioned E. coli mutant TyrRS, 
[0046] 

Thus, the suppressor tRNA used in the expression method of 
the present invention is suppressor tRNA originating in Bacillus 
species, Mycoplasma species or Staphylococcus species of 
eubacteria and capable of binding with tyrosine derivatives in 
the presence of the aforementioned mutant TyrRS. The sequences 
of these tRNA are described in 

http: //medlib.med. Utah . edu/RNAmods/trnabase or 
http: //www. sf af f . uni-bayreuth . de/~btc914 /search/ . 

These tRNAs have a suppressor tRNA sequence that functions 
in prokaryotes as well as two internal promoter consensus 
sequences that are recognized in eukaryotes, and are capable of 
binding with tyrosine derivatives in the presence of the 
aforementioned mutant TyrRS. Here, "having a suppressor tRNA 
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sequence that functions in prokaryotes" means that they are 
suppressor tRNA originating in a prokaryote and retain an 
anticodon complementary to a nonsense codon (usually an amber 
code (UAG) ) and three-dimensional structure (L-shaped structure) 
for functioning as suppressor tRNA. "Having two internal 
promoter sequences that are recognized in eukaryotes" means that 
these genes contain the aforementioned box A consensus sequence 
(SEQ. ID NO. 13) and box B consensus sequence (SEQ. ID NO. 14 
or SEQ. ID NO. 20) . "Being capable of binding with tyrosine 
derivatives in the presence of the aforementioned mutant TyrRS" 
means that they are suppressor tRNA capable of being recognized 
by mutant TyrRS and of binding with tyrosine derivatives. They 
are suppressor mutants derived from tRNA Tyr that binds with 
tyrosine, and capable of binding with tyrosine derivatives, and 
preferably position 3-substituted tyrosine in the presence of 
mutant TyrRS. 

Examples of suppressor tRNA originating in Bacillus species, 
Mycoplasma species or Staphylococcus species of eubacterial tRNA 
include suppressor tRNA originating in tRNA of Bacillus 
stearothermophilus , suppressor tRNA originating in tRNA of 
Bacillus subtilis (see 

http: //medlib.med. Utah. edu/RNAmods/trnabase/ , registration no. 
DY1540; E.F. Wawrousek, et al., (1984) J. Biol. Chem. 259, 
3694-3702), suppressor tRNA originating in tRNA of Mycoplasma 
caprocolum (see http: //medlib.med. utah. edu/RNAmods/trnabase/, 



24 

registration no. 1140; Y. Andachi, et al., (1987) Proc. Natl. 
Acad. Sci. USA 84, 7398-7402), and suppressor tRNA originating 
in tRNA of Staphylococcus aureus (see 

http : //medlib .med . Utah . edu/RNAmods/trnabase/, registration no . 
DY1480; C. Green (1993) J. Bacteriol. 175, 5091-5096). 
Suppressor tRNA originating in tRNA of Bacillus 
stearothermophilus is used preferably. 
[0047] 

(3) Expression of Mutant TyrRS and Suppressor tRNA in Animal Cells 
Any known expression system can be used to express mutant 
TyrRS in animal cells, examples of which include commercially 
available pCDNA3.1 (Invitrogen) , pAGE107 (Cytotechnology , 33 

(1990)), and pAGE103 (J. Biochem. 101, 1307 (1987)). Suppressor 
tRNA can be expressed in animal cells using any known vector for 
cloning in E. coli, an example of which is pBR322 (Sutcliffe, 
J.G., Proc. Natl. Acad. Sci. USA 75, 3737-3741 (1978)). 

[0048] 

A vector for inducible expression can be used, if necessary, 
for the mutant TyrRS, an example of which is the vector carrying 
a tetracycline-responsible promoter commercially available from 
Clontech or Invitrogen. 
[0049] 

Examples of methods for introducing the vector into the cells 
include electroporation (Chu, Nucl. Acids Res. 15, 1311-1326 
(1987)), calcium phosphate method (Chen, Mol. Cell. Biol. 7, 
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2745-2752 (1987)), and lipofection (Derijard, Cell 7, 1025-1037 
(1994); Lamb, Nature Genetics 5, 22-30 (1993)). 
[0050] 

(4) Proteins into which Non-Naturally-Occurring Amino Acids are 
incorporated 

There are no limitations on the type of protein into which 
a non-naturally-occurring amino acid is incorporated in the 
present invention. This protein may be any protein able to be 
expressed, and may also be a recombinant protein. 
[0051] 

In the present invention, it is necessary to insert a nonsense 
codon at the location where the non-naturally- occurring amino 
acid is incorporated (this nonsense codon is amber codon when 
the suppressor tRNA is an amber suppressor) . A 
non-naturally-occurring amino acid is then specifically 
incorporated at this nonsense codon (amber codon) site. 
[0052] 

Although there are no particular limitations on the method 
for site-specifically inserting a mutation into a protein, and 
known methods can be used, it can be carried out appropriately 
in compliance with, for example, the methods described in 
Hashiomoto-Gotoh, Gene 152, 271-275 (1995), Zoller, Methods 
Enzymol. 100, 468-500 (1983), Kramer, Nucleic Acids Res. 12, 
9441-9465 (1984), Kunkel, Proc. Natl. Acad. Sci. USA 82, 488-492 
(1985) or Cellular Engineering Supplement, "New Cellular 
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Engineering Experimental Protocols", Shujunsha, 241-248 (1993), 
or methods that use the "QuickChange Site-Directed Mutagenesis 
Kit" (Stratagene) . 
[0053] 

Since the present invention enables expression in animal 
cells, it enables non-naturally-occurring amino acids to be 
incorporated in proteins that were not expressed, only expressed 
in small amounts, or were unable to be subjected to modification 
following translation as a result of becoming an active form in 
E. coli or cell-free protein systems. A variety of such proteins 
are known among persons with ordinary skill in the art; examples 
of proteins into which non-naturally-occurring amino acids are 
incorporated include, but are not limited to, human epidermal 
growth factor receptor extracellular domain (H. Ogiso, et al., 
Cell 110, 775-787 (2002)), human Groucho/TLEI protein (L. 
Pickeles, et al., Structure 10, 751-761 (2002)) and rat 
muscle-specific kinase (J. Till, et al., Structure 10, 1187-1196 
(2002) ) . 

In addition, in the method of the present invention, since 
alloproteins are expressed in animal cells, non-naturally- 
occurring amino acids can also be incorporated into glycoproteins 
bound to sugar chains. The expression systems with animal 
cells, included in this invention, are particularly useful for 
preparing the alloproteins with sugar chains added in a native 
and desired pattern, in the case that these proteins have an 
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unnatural pattern for sugar chain addition when prepared by 
eel-free translation systems. 

[0054] 

(5) Host 

Another aspect of the present invention is the animal cells 
that can be employed in the expression method included in the 
present invention and into that the aforementioned aaRS and 
suppressor tRNA, and a protein gene with a nonsense codon at the 
site for a non-naturally-occurring amino acid. 

Mammalian cell lines for that the method to express 
recombinant genes have been established are preferable for the 
host animal cells used in the present invention. Examples of 
useful mammalian host cell lines include Chinese hamster ovary 
(CHO) and COS cells. Particular examples include simian kidney 
CV1 line transformed by SV40 (COS-7, ATCC CRL 1651), human 
embryonic kidney line (293 or subcloned 293 cells for growth in 
suspended media, Graham, etal., J. Gen. Virol., 36: 59 (1977)), 
Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, Proc. 
Natl. Acad. Sci. USA, 77: 4216 (1980)), mouse Sertoli cells (TM4, 
Mather, Biol. Reprod. , 23: 243-251 (1980)), human lung cells 
(W138, ATCC CCL 75) , human liver cells (Hep G2 HB 8065) , and mouse 
breast cancer (MMT 060562, ATCC CCL 51) . Since the expression 
systems for extragenic genes have been established for these 
hosts, the selection of appropriate host cells is within the 
technical scope of a person with ordinary skill in the art. 
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The expression of extragenic genes in these host cells can 
be carried out according to the methods described in, for example, 
Molecular Cloning, 2nd edition, Cold Spring Harbor Laboratory 
Press (1989) . 
[0055] 

An explanation is provided with reference to Fig. 1 of how 
a non-naturally occurring amino acid can be incorporated into 
proteins according to the aforementioned expression method of 
the present invention. The enclosure shown in Fig. 1 represents 
the cell membrane of a mammalian cell (e.g., Chinese hamster ovary 
cells (CHO cells) ) . Inside the cell (inside the enclosure) , 
suppressor tRNA (indicated as "B.s. tRNA Tyr ") and aaRS (indicated 
as "E. coli mutant TyrRS") are expressed from their respective 
expression systems. The non-naturally-occurring amino acid, 
3-iodo-L-tyrosine, here, is added to the medium (outside the 
enclosure) . 

The 3-iodo-L-tyrosine in the medium is taken up by the cell 
due to the activity of the cell itself, and binds to B.s. tRNA Tyr 
due to the function of E. coli mutant TyrRS. Subsequently, the 
3-iodo-L-tyrosine is transported to ribosomes by B.s. tRNA Tyr 
where this amino acid is used to translate a nonsense codon (here, 
a UAG codon) . In order to produce a protein that contains 
3-iodo-L-tyrosine at a desired position , the codon at this 
position of the protein gene is replaced by UAG. This gene is 
then expressed in the cell. 
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[0056] 

In this manner, the expression method of the present 
invention makes it possible to express in a animal cells a desired 
protein in that a tyrosine derivative has been incorporated at 
a desired position. 

Namely, animal cells having: 

(A) an expression vector that expresses in animal cells mutant 
TyrRS that is a derivative of TyrRS originating in E . coli with 
an enhanced specificity for a non-naturally- occurring tyrosine 
derivative as compared with specificity for tyrosine, 

(B) an expression vector that expresses in the aforementioned 
animal cells suppressor tRNA originating in Bacillus species, 
Mycoplasma species or Staphylococcus species of eubacteria and 
capable of binding with the aforementioned tyrosine derivative 
in the presence of the mutant TyrRS; and, 

(C) a desired protein gene that has undergone a nonsense mutation 
at a desired site; 

are incubated under suitable conditions in a medium suitable for 
the growth of animal cells (e.g., Opti-MEM I (Gibco BRL) in the 
case of CHO cells) to which a desired tyrosine derivative has 
been added. For example, in the case of CHO cells, the 
aforementioned cells are incubated for about 24 hours at a 
temperature of about 37°C. The concentration of a tyrosine 
derivative added to the medium is about 0.1 to 3 mM and preferably 
about 0.3 mM. 



30 

[0057] 

A different aspect of the present invention is a production 
method for a non-naturally-occurring amino acid-containing 
protein, by which alloprotein is expressed according to the 
aforementioned expression method to be recovered and purified. 
The expressed protein can be recovered from medium or a host 
cell lysate. If it is bound to a membrane, then it can be 
separated from the membrane by using a suitable washing solution 

(e.g., Triton-X 100) or by enzymatic severing. The cells can 
be lysed by various physicochemical means such as freeze-thaw 
cycling, ultrasonic treatment, mechanical crushing or using a 
cytolytic agent. 

In the case that the expressed protein forms an insoluble 
aggregation, this aggregate can be solved with a denaturant for 
protein and then subjected to the process of refolding of 
denaturing protein, for example, in a solution without denaturant 
or containing the denaturant in too low a concentration to 
denature the protein, or by dialysis. 

[0058] 

Isolation and purification of the protein can be carried out, 
considering the characteristic properties of the protein 
produced, by using any or some, in combination, of solvent 
extraction, fractional precipitation using organic solvent, 
salting out, dialysis, centrifugal separation, ultrafiltration, 
ion exchange chromatography, gel filtration chromatography, 
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hydrophobic chromatography, affinity chromatography, reverse 
phase chromatography, crystallization, electrophoresis and 
other separation procedures. 
[0059] 

Another aspect of the present invention is DNA having the 

following sequence of SEQ. ID NO. 1. 

AGCGTCCGGTTTTTCTGTGCTGAACCTCAGGGGACGCCGACACACGTACAC 
GTC GGAGGGGTAGCGAAGTGGCTAAACGCGGCGGACTCTAAATCCGCTCCCTTT 
GGGTTCGGCGGTTCGAATCCGTCCCCCTCCA GACAAGTGCGGTTTTTTT 
CTCCAGCTCCCG (SEQ. ID NO. 1) 

[0060] 

This sequence of SEQ. ID NO. 1 is an artificial base sequence 
consisting of a leader sequence of human tRNA gene (bases 1 to 
55 of SEQ. ID NO. 1) , the tRNA gene of B. stearothermophilus with 
a CUA anticodon (underlined portion; bases 56 to 137 of SEQ. ID 
NO. 1) but without the terminal CCA sequence and a transcription 
terminator (bases 138 to 167 of SEQ. ID NO. 1) in this order. 
[0061] 

DNA having the sequence of SEQ. ID NO. 1 can be inserted into 
a replicating vector for cloning (amplification of that DNA) or 
expression. Various types of vectors can be used. Said vector 
can be in the form of a plasmid, cosmid, virus particle or phage. 
Desired nucleic acid sequences can be inserted into said vector 
by various techniques. In general, DNA is inserted at a suitable 
restriction site using technology known in the art. There are 
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no limitations on the use of vector elements; the vector typically 
includes one or more signal sequences, replication origins, 
marker genes and so forth. The construction of a suitable vector 
containing one or more of these elements is a known technology 
among persons with ordinary skill in the art. 
[0062] 

Suppressor tRNA can be expressed in animal cells by 
introducing this expression vector into animal cells. Since a 
vector containing the sequence of SEQ. ID NO. 1 has a box A, box 
B and 5 ? - leader sequence as the regulatory sequences recognized 
in animal cells, when the vector was inserted into animal cells, 
the suppressor tRNA will be expressed in the animal cells by using 
these regulatory sequences when the vector was inserted into 
animal cells. 

Thus, not only can this expression vector be used in the 
aforementioned expression method of a non-naturally-occurring 
amino acid-containing protein, but it can also be used for gene 
therapy for diseases related to nonsense mutations. 
[0063] 

A cloning vector contains a nucleic acid sequence that allows 
replication of that vector in one or more host cells . In addition, 
an expression vector may contain such a sequence for replication. 
Such sequences are well known for various types of bacteria, 
yeasts and viruses. Various types of virus origins (SV40, 
polyoma, adenovirus, VSV or PBV) are useful for cloning vectors 
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in mammalian cells. 
[0064] 

Expression and cloning vectors typically contain selectable 
markers. Typical selection markers include ampicillin 
resistance gene, neomycin resistance gene, zeocin resistance 
gene, DHFR gene and thymidine kinase gene. Suitable host cells 
in the case of using wild-type DHFR are CHO cell lines lacking 
in DHFR activity; this cell line has been produced and grown as 
described in Urlaub, etal., Proc. Natl. Acad. Sci. USA, 77: 4216 
(1980) . 
[0064] 

Other methods, vectors and host cells suitable for protein 
synthesis in cultured recombinant vertebrate cells are described 
in Gething, et al., Nature, 293: 620-625 (1981) and Mantei, et 
al., Nature, 281: 40-46 (1979). 
[0066] 

Gene amplification and expression can be analyzed for a given 
sample, for example, by ordinary southern blotting, northern 
blotting for quantifying transcription of mRNA (Thomas, Proc. 
Natl. Acad. Sci. USA, 77: 5201-5205 (1980)), dot blotting (DNA 
analysis) or in situ hybridization using a labeled probe with 
the sequences provided here. Antibody can also be used capable 
of specifically recognizing double strands including DNA double 
strands, RNA double strands and DNA-RNA hybrid double strands 
or DNA-protein double strands. With these antibodies, which may 
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be labeled, said assay can be carried out in the case the double 
strand is bound to a surface so that the presence of antibody 
bound to this double strand can be detected on this surface. 

Gene expression can also be measured by such an immunological 
method as cell immunohistological staining or these applicable 
to tissue sections, as well as by cell culture or liquid assay 
in order to quantify the expression of a gene product directly. 
Useful antibodies for immunohistological staining of liquid 
samples may be either monoclonal or polyclonal antibodies, and 
they can be produced in any mammal. 
[0067] 
[Examples] 

The following provides a more detailed explanation of the 
present invention based on its examples; although the present 
invention is not limited to these examples. In the present 
example, an Ras protein in which the 32nd codon was substituted 
to UAG is expressed in CHO cells to produce an Ras protein 
containing 3-iodotyrosine at said site. 
[0068] 

(1) Suppressor tRNA 

The base sequence of the B.s. tRNA Tyr (167 bases) used as 

suppressor tRNA is as shown below. 

AGCGTCCGGTTTTTCTGTGCTGAACCTCAGGGGACGCCGACACACGTACAC 

GTC GGAGGGGTAGCGAAGTGGCTAAACGCGGCGGACTCTAAATCCGCTCCCTTT 

GGGTTCGGCGGTTCGAATCCGTCCCCCTCCAGACAAGTGCGGTTTTTTT 
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CTCCAGCTCCCG (SEQ. ID NO. 1) 

This sequence is an artificial base sequence consisting of 
a leader sequence of human tRNA gene (H. van Tol, et al., EMBO 
J. 6, 35-31 (1987)), the tRNA Tyr gene of B . stearothermophilus 
with a CUA anticodon (underlined portion) but without the terminal 
CCA sequence, and a transcription terminator (H. van Tol, et al. , 
EMBO J. 6, 35-31 (1987)) in this order. 
[0069] 

The single-strand DNA that possesses this SEQ. ID NO. 1 was 
obtained in the form of a chemically synthesized product by a 
widely used commercial service (Sigma Genosys Japan) that 
synthesizes PCR primers and other single-strand DNA. A DNA 
fragment amplified by PCR using this DNA as a template and the 
following primers (1) and (2) was cloned by severing with EcoRI 
and Hindlll followed by incorporating at the EcoRI-Hindlll site 
of pBR322. 

Primer (1) : CACAGAATTCTCGGGAGCTGGAGAAAAAAAC (SEQ. ID NO. 21) 
Primer (2) : CACAAAGCTTAGCGCTCCGGTTTTTCTGTG (SEQ. ID NO. 22) 

A gene cluster comprised of nine copies of the suppressor 
tRNA tyr gene for from B. stearothermophilus all in the same 
direction was constructed by the following two steps (Fig. 3) . 
[0070] 

First, PCR as described below was carried out with the GeneAmp 
PCR System 9700 (Applied Biosystems ) using three different primer 
sets 1 to 3 each including primers (1) and (2) . 
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In the first reaction, a fragment was produced having a primer 
binding site pbsl (SEQ. ID NO. 15) upstream of the sequence of 
SEQ. ID NO. 1 and a BstXI-1 site (CCAGCAGACTGG : SEQ . ID NO. 17) 
downstream of this DNA, by carrying out PCR under normal reaction 
conditions using primer (1) of the first set: 

AGCGAGTGTTAACCCTGCCTAGCGCTCCGGTTTTTCTGTG (SEQ. ID NO. 23) 
and primer (2) of the first set: 

ACACACCCAGCAGACTGGCGGGAGCTGGAGAAAAAAAC (SEQ ID NO. 24). 
[0071] 

In the second reaction, a fragment was produced having a 
BstXI-1 site (SEQ. ID NO. 17) upstream of the sequence of SEQ. 
ID NO. 1 and another BstXI site, CCAGCTTCCTGG (BstXI-2; SEQ. ID 
NO. 18) downstream from this sequence, by carrying out PCR under 
normal reaction conditions using primer (1) of the second set: 

ACACACCCAGCAGACTGGAGCGCTCCGGTTTTTCTGTG (SEQ ID NO. 25) 
and primer (2) of the second set: 

ACACACCCAGCTTCCTGGCGGGAGCTGGAGAAAAAAAC (SEQ. ID NO. 26). 
[0072] 

In the third reaction, a fragment was produced having a 
BstXI-2 site (SEQ. ID NO. 18) upstream of the sequence of SEQ. 
ID NO. 1 and the primer binding site pbs2 (SEQ. ID NO. 16 downstream 
of SEQ No. 1, by carrying out PCR under normal reaction conditions 
using primer (1) of the third set: 

ACACACCCAGCTTCCTGGAGCGCTCCGGTTTTTCTGTG (SEQ ID NO. 27) 
and primer (2) of the third set: 



37 

CTGCGCGAATATCGTAGTCGCGGGAGCTGGAGAAAAAAAC (SEQ. ID NO. 28). 
[0073] 

These three PCR products were coupled together using a ligase 
according to known technology to produce a sub-cluster comprised 
of three copies of tRNA gene. 

This sub-cluster was amplified in the form of a sub-cluster 
fragment to which EcoRI site was added to one end and Hindlll 
site was added to the other end using a primer with an EcoRI 
restriction site added to the sequence of pbsl (SEQ. ID NO. 15) 
and a primer with a Hindlll restriction site was added to the 
sequence of pbs2 (SEQ. ID NO. 16) . 
[0074] 

Moreover, a fragment to which Hindlll and EcoRI sites were 
added at the ends, and a sub-cluster fragment to which EcoRI and 
BamHI sites were added to the ends, were similarly produced. 
Finally, three types of sub-clusters were produced having 
different combinations of restriction sites. These 
sub-clusters 1 to 3 were concatenated with each other with ligase, 
and then cloned within EcoRI and BamHI sties of pBR322 (Takara 
Shuzo) to produce plasmid pBstRNA having nine copies of B. 
stearothermophilus suppressor tRNA Tyr gene. 
[0075] 

The structure of the resulting cloned fragment is shown in 
Fig. 3. 

In Fig. 3, the base sequences of pbsl, pbs2, BstX-1 and BstX-2 
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are as shown below. 

pbsl: AGCGAGTGTTAACCCTGCCT (SEQ. ID NO. 15) 
pbs2: CGACTACGATATTCGCGCAG (SEQ. ID NO. 16) 
BstX-1: CCAGCAGACTGG (SEQ. ID NO. 17) 
BstX-2: CCAGCTTCCTGG (SEQ. ID NO. 18) 
Since plasmid pBstRNA has nine copies of B. 

stearothermophilus suppressor tRNA Tyr gene, the amount of B. 

stearothermophilus suppressor tRNA Tyr gene expressed in animal 

cells can be increased, and is useful for expressing alloproteins 

in animal cells. 

[0076] 

(2) Mutant TyrRS 

The base sequence of E. coli mutant TyrRS (to be referred 
to as TyrRS (V37C195) gene is described in Document 5. 

In the aforementioned method, a DNA sequence that encodes 
a single amino acid-substituted form in which a single amino acid 
at position 37 or 195 is substituted was produced using the 
aforementioned primers (3) to (8). Primers (3) and (4) are for 
altering position 37. In addition, primers (5) to (8) are for 
altering position 195. 

Next, two fragments were amplified with primer pair (1) and 
(10) and primer pair (9) and (11), using the plasmid that encodes 
a single amino acid-substituted TyrRS at positions 37 and 195, 
respectively. The two fragments were purified and then coupled 
by PCR using primers (1) and (9) . 
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The PCR amplification product was inserted into the multiple 
cloning site of vector pcDNA4/T0 (Invitrogen) to create plasmid 
pEYSMl. 
[0077] 

(3) Amber Suppression in Mammalian Cells 

Transfection was carried out using 0.5 to 2 jig of DNA for 
each plasmid per 35 mm plate in accordance with the Lipof ect AMINE 
2000 (Gibco BRL) method. Opti-MEM 1 (Gibco BRL) was used for 
the medium. Cell extracts were prepared 24 hours after 
transfection and applied to SDS-PAGE followed by western blotting 
using anti-FLAG2 antibody (Sigma) and the ECL+ Immunodetection 
System (Amersham Pharmacia Biotech) . Band intensity was 
measured using the LAS-lOOOplus image analyzer (Fuji Film) . The 
ras (Am) products and a wild type ras product for comparison (0.5 
jig each) were each purif ied, using anti-FLAG M2 antibody affinity 
gel (Sigma) , from 1 to 5 culture plates (diameter: 100 mm) . This 
was followed by liquid chromatography-electrospray mass analysis 
(LC-MS) and tandem mass analysis sequencing. 
[0078] 

(4) CHO-Y Cells 

In order to create CHO-Y cells that stably maintain TyrRS 
(V37C195) , T-REX-CHO cells that constitut ively produce 
tetracycline repressor protein (Invitrogen) were transfected 
with plasmid pEYSMl . The transf ectant s were selected in medium 
containing 25 jag/ml of zeocin, and the expression of TyrRS 
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(V37C195) in the selected cells in the presence of 1 |ag/ml of 
zeocin was checked to obtain CHO-Y cells. In order to synthesize 
alloprotein, the CHO-Y cells were transfected with a plasmid 
containing ras (Am) and B. stearothermophilus suppressor tRNA Tyr 
gene. Tetracycline (1 jig/ml) and 3-iodo-L-tyrosine (0.3mM) were 
added to the medium 24 hours after transfection and a cell extract 
was prepared after further 2 4 hours. 
[0079] 

(5) Incorporation of 3-iodo-L-tyrosine at Amber Codon 

The following two points were taken into consideration to 
incorporate 3-iodo-L-tyrosine in protein. Firstly, mammalian 
cells have a transport mechanism by which inherent amino acids 
and their various analogs are taken up from the environments. 
Secondly, wild type TyrRSs from eukaryotes and prokaryotes do 
not recognize 3-iodo-L-tyrosine as substrate. Therefore, cells 
do not suffer from the toxicity resulting from at-random 
3-iodo-L-tyrosine incorporation at tyrosine positions. Mutant 
TyrRS that is able to recognize 3-iodo-tyrosine is required to 
bind this amino acid to the suppressor tRNA. 

We therefore used E. coli TyrRS (V37C195), which has been 
reported to efficiently recognize 3-iodo-L-tyrosine (K ca t/K m 
value for amino acid activation = 3.3 x 10 3 /M/s) and recognize 
tyrosine at a sufficiently low efficiency (K ca t/Km value = 3.2 
x 10 2 /M/s) . Moreover, TyrRS (V37C195) activates L-tyrosine at 
an efficiency roughly 10,000 times lower as compared with the 
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K cat /K m value (2.3 x 10 6 /M/s) with respect to the wild type enzyme. 
[0080] 

Figs. 5 is a photograph of western blotting for detecting 
amber suppression. (A) shows western blotting in the absence 
of 3-iodo-L-tyrosine, while (B) shows it in the presence of 
3-iodo-L-tyrosine . The ras (Am) gene was introduced at all lanes . 
The wild type E. coli TyrRS (lane 1 in A and B) or TyrRS (V137C195) 
(lane 2 in A and B) were expressed in CHO cells together with 
B. stearothermophilus suppressor tRNA. The presence or absence 
of 3-iodo-L-tyrosine (IY) and the type of TyrRS expressed (wt 
for TyrRS wild type and mut for TyrRS (V37C195)are indicated. 
[0081] 

As shown in Fig. 5, wild type TyrRS and TyrRS (V37C195) were 
each expressed in CHO cells together with B. stearothermophilus 
suppressor tRNA and ras (Am) gene. These enzymes were expressed 
at the same levels. Although amber suppression was observed for 
both enzymes in the absence of 3-iodo-L-tyrosine (Fig. 5 A) , the 
yield of the ras (Am) product with TyrRS (V37C195) was only 40% 
of the wild type enzyme. This indicates that in the absence of 
competing 3-iodo-L-tyrosine, TyrRS (V37C195) recognizes 
L-tyrosine and incorporates it at the amber location. Next, 
3-iodo-L-tyrosine was added to the medium to a final 
concentration of 0.3 mM (L-tyrosine was present at twice that 
concentration (Fig. 5 B) ) . This concentration of 
3-iodo-L-tyrosine had hardly any effect on cell reproduction. 
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In the presence of 3-iodo-L-tyrosine, the suppression 
efficiency with TyrRS (V37C195) was improved to a level 
comparable to that with the wild type enzyme. Thus, 
3-iodo-L-tyrosine was efficiently taken up by the cells, to be 
incorporated into protein depending on TyrRS (V37C195) . 
[0082] 

(6) Confirmation of Ras Products 

In order to confirm the incorporation of 3-iodo-L-tyrosine 
at the amber location (position 32) , the ras product was degraded 
by Achromobacter protease I (Lys-C) . The resulting degradation 
product in the form of a peptide mixture was analyzed using a 
liquid chromatography-mass analyzer (LC-MS) . Then, the amino 
acid incorporated at position 32 was identified on the basis of 
the specific average mass and the elution time in liquid 
chromatography. The fragment corresponding to the sequence from 
serine at position 17 to lysine at position 42 was called the 
IY and Y fragments according to the presence of iodotyrosine and 
tyrosine, respectively, at position 32. 
[0083] 

Fig. 6 is a graph showing the results of LC-MS analysis of 
the ras and ras (Am) products. 

Fig. 6 A shows the results of liquid chromatography in which 
the ras (Am) fragments (Chart A) and the ras fragments (Chart B) 
were detected based on their UV absorbance. 

Fig. 6 B shows the results of the mass spectrometry for the 
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IY fragment (Charts a and c) and Y fragment (Charts b and d) from 
the ras (Am) product (Charts a and b) and ras product (Charts c 
and d) . The Y fragment is composed of residues 17 to 42 of Ras 
protein ( SALTIQLIQNHFVDEYDPTIEDSYRK) , while the IY fragment has 
3-iodo-L-tyrosine substituting for the underlined Y. 

Fig. 6 C shows the results of tandem mass spectrometry of 
the IY fragment. The partial sequence in the direction of the 
C-terminal from the N-terminal consists of Val, Asp, Glu, 
iodotyrosine and Asp in this order. 
[0084] 

During analysis of the ras (Am) product, the IY fragment was 
strongly observed (Fig. 6 B, Chart A) , indicating efficient 
incorporation of 3-iodo-L-tyrosine. The partial sequence of 
this IY fragment as determined by tandem mass spectrometry 
confirmed that iodotyrosine was actually incorporated at 
position 32 (Fig. 6 C) . On the other hand, it was also shown 
that the Y fragment was not detected (Fig. 6B, Chart b) ; 
3-iodo-L-tyrosine thus inhibited the incorporation of L-tyrosine 
at position 32. Further analysis of the LC-MS data indicated 
that other normal amino acids were not incorporated at this 
position. The incorporation of iodotyrosine at position 32 was 
thus confirmed by LC-MS analysis. In contrast, although the Y 
fragment was observed for the ras(WT) product synthesized in the 
presence of 3-iodo-L-tyrosine (Chart d) , while the IY fragment 
was not observed (Fig. 6 B, Chart c) . Since 3-iodo-L-tyrosine 
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would be incorporated at the location of tyrosine by any 
recognition error, this observation indicates that intrinsic 
TyrRS in CHO cells does not recognize 3-iodo-L-tyrosine, nor does 
TyrRS (V37C195) recognize intrinsic tRNA. 

As shown in Fig. 6 B, the peak of a peptide that contains 
3-iodo-L-tyrosine was detected (Fig. 6 B(a) ) . On the other hand, 
if the UAG codon is assumed to be translated to an amino acid 
other than 3-iodo-L-tyrosine, it is most likely that it would 
be translated to tyrosine; in fact, a peptide that contained 
tyrosine was not detected (Fig. 6B(b)). In addition, peptides 
that could be produced if the UAG codon had been translated to 
another amino acid were not detected at all. 
[0085] 

These results indicate that nearly 100% of the Ras protein 
produced contained 3-iodo-L-tyrosine at the location specified 
by the UAG codon, and that the present invention yields the 
expected effects. 
[0086] 

(7) Conditioned Incorporation of 3-iodo-L-tyrosine 

Controlled by Inducible Expression of Mutant TyrRS 
It has been reported that E. coli GlnRS causes induced 
suppression by being expressed from a tetracycline control 
promoter in mammals. We created a CHO cell line (referred to 
as CHO-YS [sic] (CHO-Y) cells) that stably retain the TyrRS 
(V37C195) gene expressed from another type of 
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tetracycline-controlled promoter. The ras (Am) gene and B. 
stearothermophilus suppressor tRNA gene were then transiently 
introduced into the CHO-YS [sic] (CHO-Y) cells. 
[0087] 

Fig. 7 is a photograph of a western blot of inducible amber 
suppression for the incorporation of 3-iodo-L-tyrosine by Ras 
protein. The ras (Am) gene was introduced into CHO-Y cells. The 
ras (Am) gene was introduced into CHO-Y cells together with B. 
stearothermophilus suppressor tRNA Tyr (lanes 1-3) . Lane 1 
indicates the addition of tetracycline and 3-iodo-L-tyrosine, 
lane 2 the addition of tetracycline without the addition of 
3-iodo-L-tyrosine, and lane 3 the absence of the addition of 
tetracycline or 3-iodo-L-tyrosine. 
[0088] 

As shown in Fig. 7, TyrRS (V37C195) was expressed when 
tetracycline was present in the medium (Fig. 7, lanes 1 and 2), 
but was not expressed in the absence of inducer (lane 3) . The 
level of expression was twice the level of TyrRS (V37C195) 
expressed from a plasmid transiently introduced into the cells. 
The ras (Am) product was detected at a suppression efficiency of 
30% in the presence of both 3-iodo-L-tyrosine and tetracycline 
(lane 1). The quality of the ras (Am) product was indicated as 
being identical to the quality of ras (WT) similarly produced in 
CHO-Y cells, and the quality in the presence of TyrRS (V37C195) 
from plasmid as analyzed by LC-MS. More than 95% of the ras (Am) 
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product contained 3-iodo-L-tyrosine at the amber position, while 
3-iodo-L-tyrosine was not detected in the ras(WT) product, 
[0089] 

On the other hand, the ras (Am) product was hardly detected 
in the absence of 3-iodo-L-tyrosine (lane 2) , and was not detected 
in the absence of inducer (lane 3) . These observations indicated 
that the incorporation of 3-iodo-L-tyrosine into the ras (Am) 
product is effectively conditioned by means of the induction of 
the expression of TyrRS (V37C195) by tetracycline. A comparison 
of Fig. 7 and Fig. 5A indicates that the incorporation of 
L-tyrosine in the absence of 3-iodo-L-tyrosine is remarkably 
lower than in the case of having expressed TyrRS (V37C195) from 
a plasmid. This unexpected result was observed in three or more 
independent observations. The mechanism on which this 
phenomenon is based is a subject for further research. 
[0090] 

[Reference Example] 

Construction of TyrRS Gene and Reporter Gene 

An FLAG tag (DYKDDDDK) was added to the C terminals of mutant 
TyrRS (V37C195) gene, ras gene and epidermal growth factor 
receptor reporter gene by amplifying these genes with suitable 
PCR primers. The PCR products were respectively cloned to vector 
pcDNA3 . 1/Zeo (+) (Invitrogen) in order to express them in 
mammalian cells. With respect to TyrRS (V37C195) , the PCR 
product was also cloned to vector pcDNA4/TO (Invitrogen) to 
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produce plasmid pRYSMl for tetracycline-controlled expression. 
Site-directed mutagenesis of the ras gene was carried out by PCR 
using a mutation-inducing primer. Similarly, the tyrosine codon 
at position 1068 of epidermal growth factor receptor was mutated 
to an amber codon. The first methionine residue of green 
fluorescent protein (cyanof luorescent mutation) (Clontech) was 
substituted with a short peptide encoded by: 

ATGGGAACTAGTCCA TAG TGGTGGAATTCTGCAGATATCCAGCACAGTGGCGGC 
CGCCGGTC (amber codon is underlined) (SEQ. ID NO. 19) 
and a FLAG tag was added to the C terminal. The resulting gene 
was cloned to vector pcDNA3 . 1 (Zeo (+) . The sequence of the 
constructed gene was confirmed using the ABI Prism 377 DNA 
Sequencer (Applied Biosystems) . 
[0091] 

Transfection was carried out in accordance with the 
Lipofect AMINE 2000 (Gibco BRL) method using 0.5 to 2 jag of DNA 
for each of the reporter gene expression vectors, suppressor tRNA 
expression vectors and E. coli TyrRS expression vectors. 
Opti-MEM 1 (Gibco BRL) was used for the medium. Cell extracts 
were prepared 24 hours after transfection and applied to SDS-PAGE 
followed by western blotting using anti-FLAG2 antibody (Sigma) 
and the ECL+ Immunodetection System (Amersham Pharmacia Biotech) . 
Band intensity was measured using the LAS-1000 Plus Image 
Analyzer (Fuji Film) (Figs. 4A and 4B) . 
[0092] 



48 

Fig. 4 is a photograph of the gel of the western blots obtained 
using anti-FLAG antibody in order to detect amber suppression 
of CHO cells. 

In Fig. 4 A, FLAG tags are respectively added to the wild 
type ras gene (Lane 1) and ras (Am) gene (Lanes 2-8), and these 
genes were introduced into the CHO cells. Human suppressor 
tRNA Tyr (Lane 3) or E. coli TyrRS and E. coli suppressor tRNA Tyr 
(Lane 4) were expressed in the CHO cells. B. stearothermophilus 
suppressor tRNA Tyr was expressed in the CHO cells together with 
E. coli TyrRS (Lanes 5 and 8) or in the absence of enzyme (Lane 

7) , and even E . coli TyrRS alone was expressed in the CHO cells 
(Lane 6) . B. stearothermophilus suppressor tRNA Tyr was expressed 
from a plasmid having a single copy of the gene for this tRNA (Lanes 
5 and 7) and from a plasmid having nine copies of the gene (Lane 

8) . Lane 1 contained 2.5 (ig of cell extract, while lanes 2 to 
8 contained four times that amount of cell extract. 

In Fig. 4 B, genes containing the amber codon at position 
1068 (Lanes 1-3) as well as wild type EGFR gene (Lane 2) and 
epidermal growth factor receptor (EGFR) gene to which FLAG tags 
were respectively added were introduced into CHO cells. The B. 
stearothermophilus suppressor tRNA Tyr and E. coli TyrRS pair was 
expressed in the cells (Lane 3) . The weak bands that migrated 
at the level of E. coli TyrRS in lanes 1 and 2 correspond to 
endogenous proteins that reacted to anti-FLAG antibody. 
[0093] 
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As shown in Fig. 4, the co-expression of B. 
stearothermophilus suppressor tRNA and E. coli TyrRS was found 
to cause suppression of amber mutation in ras (Am) at an efficiency 
of 15% (Fig. 4 A, Lane 5) . This suppression did not occur in 
the absence of this tRNA (Lane 6) or enzyme (Lane 7) . The need 
for E. coli TyrRS for suppression indicates that B. 
stearothermophilus suppressor tRNA Tyr is not aminoacylated by 
endogenous aaRS in cells. On the other hand, the expression of 
E . coli TyrRS from a transiently introduced plasmid itself hardly 
affect the growth rate of CHO cells. 
[0094] 

The suppression efficiency of the tRNA-TyrRS pair was 
considerably lower than the efficiency of human suppressor tRNA 
or other suppressor tRNAs that act in mammalian cells (20 to 40%) . 
Since suppression efficiency was thought to be improved by 
increasing the copy number of suppressor tRNA gene, a plasmid 
was constructed that had nine copies of B. stearothermophilus 
suppressor tRNA gene, and this plasmid was introduced into CHO 
cells along with a plasmid having E . coli TyrRS gene. As a result, 
suppression efficiency was improved to 24% in this manner (Fig. 
4A, Lane 8) . This value is comparable to that of human suppressor 
tRNA. This plasmid was subsequently used to express B. 
stearothermophilus suppressor tRNA. 
[0095] 

The pair consisting of B. stearothermophilus suppressor tRNA 
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and E. coli TyrRS caused suppression of amber mutation in human 
embryonic kidney cell line 293 at the same efficiency as that 
in CHO cells. Moreover, amber mutation of epidermal growth 
factor receptor gene was suppressed at an efficiency of 20% (Fig. 
4B) , while amber mutation in Aequorea victoria green fluorescent 
protein gene was suppressed with the same efficiency. 

These genes and ras (Am) gene have different codons 
surrounding the amber codon. 
[0096] 

[Comparative Example] 

Need for Expression of Prokaryotic tRNA-TyrRS Pair for Amber 
Suppression in Mammalian Cells 

Expression of tRNA in eukaryotes requires two internal 
promoters (boxes A and B) within an RNA coding sequence. Since 
E. coli tRNA only contains box B, box A was created by 
substituting U9 and CIO with A and G, respectively. As a result, 
the resulting mismatch base pair, G10-G25, was corrected by 
substituting G25 with C. (to be referred to as tRNA Tyr (A9G10C25) . 
Positions 9, 10 and 25 of E. coli tRNA Tyr are involved in 
three-dimensional interaction and support an L-shaped structure. 
[0097] 

The sequence of tRNA Tyr (A9G10C25) having a CUA anticodon was 
coupled to the 5 '-flanking sequence of human tRNA gene. Human 
suppressor tRNA gene was constructed in the same manner as the 
control of amber suppression. In order to analyze amber 
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suppression, the tyrosine codon at position 32 in a synthetic 
ras gene truncated from the wild type c-Ha-Ras was mutated to 
an amber codon. In order to detect expression, a FLAG peptide 
tag was added to the C-terminals of the ras genes, ras(WT) and 
its amber mutant ras (Am) , and to the Oterminal of E. coli TyrRS. 
[0098] 

These ras gene were introduced into CHO cells, and their 
products were detected by western blotting of cell extracts using 
anti-FLAG antibody (Fig. 4) . Although expression of ras(WT) gene 
was detected in the absence of suppressor tRNA (Lane 1), 
expression of ras (Am) was not detected (Lane 2) . This indicates 
that intrinsic suppressor activity is lacking in the cells. 
Human suppressor tRNA caused suppression of amber mutation in 
ras (Am) at an efficiency of 26% as measured on the basis of band 
intensity (Lane 3) . On the other hand, E. coli tRNA (A9G10C25) , 
which has a CUA anticodon, fail to cause suppression together 
with wild type TyrRS from E. coli (Lane 4) . Next, we investigated 
another E. coli suppressor tRNA mutant using another nucleotide 
set capable of forming box A (G9G10C25) . This tRNA was also 
unable to cause suppression. In this manner, the formation of 
box A in E. coli suppressor tRNA impaired its activity. This 
is probably because E. coli tRNA with box A fail to maintain the 
three-dimensional structure resulting in impairments of tRNA 
maturation or aminoacylation . 
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[0099] 

[Effects of the Invention] 

According to the expression method included in the present 
invention, non-naturally-occurring amino acids can be 
incorporated into a desired protein in response to nonsense 
codons, in the animal cells expressing the aforementioned mutant 
of E. coli TyrRS, the aforementioned suppressor tRNA derived from 
eubacterias, such as Bacillus, Mycoplasma, or Staphylococcus, 
and the gene coding for the desired protein with nonsense codons 
at desired positions. The non-naturally-occurring amino acids 
are taken up from the growth medium by the cells. 

Previously, only E. coli cells were available for producing 
alloproteins , the proteins with non-naturally-occurring amino 
acids at desired positions. The present invention has enabled 
the production of such proteins in animal cells. Animal cells 
are better systems than E. coli cells for preparing the proteins 
from animals, including the human being, and would thus be useful 
for preparing alloproteins that are difficult to prepare with 
E. coli cells. 

There are various merits in incorporating 
non-naturally-occurring amino acid into a protein. The present 
invention specifically enables the incorporation of iodotyrosine, 
which makes it possible to incorporate heavy atom into a protein 
for X-ray crystallography of the protein and label a protein with 
radioactive iodine atoms. 
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In particular, iodine atoms would facilitate the functional 
and structural analyses of the proteins with iodine atoms, 
because this atom gives a characteristic signal for NMR 
spectroscopy. 

Moreover, proteins incorporated with 3-halogenated tyrosine 
exhibit novel physiological properties, therefore they are 
subjected to the development of new drugs or drug-delivery 
systems . 

In addition, the suppressor tRNA-expressing vector of the 
present invention enables the suppression of nonsense mutation 
in animal cells, therefore, this has a potential utility in the 
gene therapy of disease involved in nonsense mutations. 
[0100] 

SEQUENCE LISTING 

<110> RIKEN; Japan Science and Technology Agency 

<120> Non-naturally-occurring amino acid-containing protein 
expression method 

<150> JP2002-318846 
<151> 2002-10-31 

<160> 29 

<210> 1 
<211> 167 
<212> DNA 

<213> Artificial Sequence 
<400> 1 

agcgctccgg tttttctgtg ctgaacctca ggggacgccg acacacgtac 
acgtcggagg 

ggtagcgaag tggctaaacg cggcggactc taaatccgct ccctttgggt 
tcggcggttc 

gaatccgtcc ccctccagac aagtgcggtt tttttctcca gctcccg 167 
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<210> 2 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<400> 2 

ggaattccat atggcaagca gtaacttgat taaacaattg caag 44 

<210> 3 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<400> 3 

gccgaagctt gtcgactttc cagcaaatca gacagtaatt ctttttaccg 

<210> 4 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<400> 4 

aggatcgaag ccgcaagcga gcgcgatcgg gccttgcgcc 4 0 

<210> 5 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<400> 5 

aggatcgaag ccgcamnnga gcgcgatcgg gccttgcgcc 4 0 

<210> 6 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<400> 6 

acggtgtggt gctgtctatt ggtggttctg acc 33 

<210> 7 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<400> 7 

acggtgtggt gctggcaatt ggtggttctg acc 33 
<210> 8 
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<211> 33 
<212> DNA 

<213> Artificial Sequence. 
<400> 8 

acggtgtggt gctgaacatt ggtggttctg acc 33 

<210> 9 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<400> 9 

acggtgtggt gctgtgcatt ggtggttctg acc 33 

<210> 10 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<400> 10 

ttcttcggat ccaaccagac tgcgccgcct tc 32 

<210> 11 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<400> 11 

gatcatctgg ttaacggaga agtgtttgcc 30 

<210> 12 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<400> 12 

gaccttcctg tgcgatattg gcaaac 26 

<210> 13 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<400> 13 

trgcnnagyn gg 12 

<210> 14 
<211> 11 
<212> DNA 
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<213> Artificial Sequence 
<400> 14 

ggttcgantc c 11 

<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<400> 15 

agcgagtgtt aaccctgcct 20 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<400> 16 

cgactacgat attcgcgcag 20 

<210> 17 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<400> 17 

ccagcagact gg 12 

<210> 18 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<400> 18 

ccagcttcct gg 12 

<210> 19 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<400> 19 

atgggaacta gtccatagtg gtggaattct gcagatatcc agcacagtgg 

cggccgccgc 

gtc 63 



<210> 20 
<211> 11 
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<212> DNA 

<213> Artificial Sequence 
<400> 20 

agttcgantc t 11 

<210> 21 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<400> 21 

cacagaattc tcgggagctg gagaaaaaaa c 31 

<210> 22 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<400> 22 

cacaaagctt agcgctccgg tttttctgtg 30 

<210> 23 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<400> 23 

agcgagtgtt aaccctgcct agcgctccgg tttttctgtg 

<210> 24 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<400> 24 

acacacccag cagactggcg ggagctggag aaaaaaac 38 

<210> 25 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<400> 25 

acacacccag cagactggag cgctccggtt tttctgtg 38 

<210> 26 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
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<400> 26 

acacacccag cttcctggcg ggagctggag aaaaaaac 38 

<210> 27 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<400> 27 

acacacccag cttcctggag cgctccggtt tttctgtg 38 

<210> 28 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<400> 28 

ctgcgcgaat atcgtagtcg cgggagctgg agaaaaaaac 40 

<210> 29 
<211> 424 
<212> PRT 

<213> Escherichia coli 
<400> 29 

Met Ala Ser Ser Asn Leu lie 
1 5 

Val Ala Gin Val Thr Asp Glu 
20 



Gin Gly Pro lie Ala Leu Tyr 
35 

Ser Leu His Leu Gly His Leu 
50 

Phe Gin Gin Ala Gly His Lys 
65 

Thr Gly Leu lie Gly Asp Pro 
80 

Leu Asn Thr Glu Glu Thr Val 
95 



Lys Gin Leu Gin Glu Arg Gly Leu 
10 15 

Glu Ala Leu Ala Glu Arg Leu Ala 
25 30 



Cys Gly Phe Asp Pro Thr Ala Asp 
40 45 

Val Pro Leu Leu Cys Leu Lys Arg 
55 60 

Pro Val Ala Leu Val Gly Gly Ala 
70 75 

Ser Phe Lys Ala Ala Glu Arg Lys 
85 90 



Gin Glu Trp Val Asp Lys lie Arg 
100 105 



Lys Gin Val Ala Pro Phe Leu Asp Phe Asp Cys Gly Glu Asn Ser 



110 
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115 



120 



Ala lie Ala Ala Asn Asn Tyr Asp Trp Phe Gly Asn Met Asn Val 
125 130 135 

Leu Thr Phe Leu Arg Asp lie Gly Lys His Phe Ser Val Asn Gin 
140 145 150 

Met lie Asn Lys Glu Ala Val Lys Gin Arg Leu Asn Arg Glu Asp 
155 160 165 

Gin Gly lie Ser Phe Thr Glu Phe Ser Tyr Asn Leu Leu Gin Gly 
170 175 180 

Tyr Asp Phe Ala Cys Leu Asn Lys Gin Tyr Gly Val Val Leu Gin 
185 190 195 

lie Gly Gly Ser Asp Gin Trp Gly Asn lie Thr Ser Gly lie Asp 
200 205 210 

Leu Thr Arg Arg Leu His Gin Asn Gin Val Phe Gly Leu Thr Val 
215 220 225 

Pro Leu lie Thr Lys Ala Asp Gly Thr Lys Phe Gly Lys Thr Glu 
230 235 240 

Gly Gly Ala Val Trp Leu Asp Pro Lys Lys Thr Ser Pro Tyr Lys 
245 250 255 

Phe Tyr Gin Phe Trp lie Asn Thr Ala Asp Ala Asp Val Tyr Arg 
260 265 270 

Phe Leu Lys Phe Phe Thr Phe Met Ser lie Glu Glu lie Asn Ala 
275 280 285 

Leu Glu Glu Glu Asp Lys Asn Ser Gly Lys Ala Pro Arg Ala Gin 
290 295 300 

Tyr Val Leu Ala Glu Gin Val Thr Arg Leu Val His Gly Glu Glu 
305 310 315 

Gly Leu Gin Ala Ala Lys Arg lie Thr Glu Cys Leu Phe Ser Gly 
320 325 330 

Ser Leu Ser Ala Leu Ser Glu Ala Asp Phe Glu Gin Leu Ala Gin 
335 340 345 



Asp Gly Val Pro Met Val Glu Met Glu Lys Gly Ala Asp Leu Met 
350 355 360 
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Gin Ala Leu Val Asp Ser Glu Leu Gin Pro Ser Arg Gly Gin Ala 
365 370 375 

Arg Lys Thr lie Ala Ser Asn Ala lie Thr lie Asn Gly Glu Lys 
380 385 390 

Gin Ser Asp Pro Glu Tyr Phe Phe Lys Glu Glu Asp Arg Leu Phe 
395 400 405 

Gly Arg Phe Thr Leu Leu Arg Arg Gly Lys Lys Asn Tyr Cys Leu 
410 415 ^ 420 

lie Cys Trp Lys 
424 

[Brief Description of the Drawings] 

[Fig. 1] This is an explanatory drawing showing a mammalian 
cell system for incorporating 3-iodo-L-tyrosine in response to 
an amber codon into a protein gene. Although the 
3-iodo-L-tyrosine (IY) is also present with L-tyrosine (Y) in 
the medium, it binds to any B.s. tRNA (Bacillus 
stearothermophilus tRNA) due to a specific E. coli TyrRS 
introduced into the cells. 

[Fig. 2] This shows the sequence and structure of a 
suppressor tRNA originating in E. coli tRNA and suppressor tRNA 
originating in Bacillus stearothermophilus tRNA. 

[Fig. 3] This is a drawing showing the structure of a plasmid 
having nine copies of Bacillus stearothermophilus suppressor 
tRNA in series. 

[Fig. 4] This is a photograph of the gels obtained from 
western blotting using anti-FLAG antibody for detecting an amber 
suppression in CHO cells. 

[Fig. 5] This is a photograph of western blots for detecting 
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amber suppression . 

[Fig. 6] This is a graph showing the results of LC-MS 
analysis of ras and ras (Am) products. 

[Fig. 7] This is a photograph of a western blot of inducible 
amber suppression for the incorporation of 3-iodo-L-tyrosine 
into Ras protein. 

[Fig. 8] This is a drawing showing an amino acid sequence 
(one-letter sequence code) of TyrRS (wild type) of E. coli. 
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[Name of Document] Abstract 
[Abstract] 

[Problems] To provide an expression system of alloproteins in 
animal cells. 

[Means for Solving] An expression method for 
non-naturally-occurring amino acid-containing protein 
comprising: expressing in animal cells : (A) a mutant tyrosyl-tRNA 
synthetase that is a mutation of tyrosyl-tRNA synthetase 
originating in E. coli with an enhanced specificity for a 
non-naturally-occurring tyrosine derivative as compared with the 
specificity for tyrosine; (B) suppressor tRNA originating in 
Bacillus species, Mycoplasma species or Staphylococcus species 
of eubacteria and capable of binding with the tyrosine derivative 
in the presence of the mutant tyrosyl tRNA synthetase; and, (C) 
a desired protein gene that has undergone a nonsense mutation 
at a desired site; wherein, the tyrosine derivative is 
incorporated at the location of this nonsense mutation. 
[Selected Figure] None 



